fix by awni · Pull Request #2701 · ml-explore/mlx

awni · 2025-10-23T21:43:35Z

No description provided.

* Add quantize/dequantize slow path for mxfp8 and nvfp4 * fast cuda kernel for mx/nv quantization * fallback for cuda < 12.8 (#2697) * format (#2700) * fix (#2701) * metal kernels * docs * fix jit * add default bits and group sizes * improve quant docs * fix output type of mxfp4 matmuls

fix

275a4a8

awni merged commit 16e2af9 into ml-explore:mxfp8_and_nvfp4 Oct 23, 2025
0 of 2 checks passed

awni pushed a commit that referenced this pull request Oct 27, 2025

fix (#2701)

6959732

awni deleted the mxfp8_and_nvfp4 branch November 1, 2025 20:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix#2701

fix#2701
awni merged 1 commit intoml-explore:mxfp8_and_nvfp4from
awni:mxfp8_and_nvfp4

awni commented Oct 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

awni commented Oct 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant